The critical effect of dependency groups on the function of networks 
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Current network models assume one type of links to define the relations between the network 
entities. However, many real networks can only be correctly described using two different types of 
relations. Connectivity links that enable the nodes to function cooperatively as a network and de- 
pendency links that bind the failure of one network element to the failure of other network elements. 
Here we present for the first time an analytical framework for studying the robustness of networks 
that include both connectivity and dependency links. We show that the synergy between the two 
types of failures leads to an iterative process of cascading failures that has a devastating effect on 
the network stability and completely alters the known assumptions regarding the robustness of net- 
works. We present exact analytical results for the dramatic change in the network behavior when 
introducing dependency links. For a high density of dependency links the network disintegrates in a 
form of a first order phase transition while for a low density of dependency links the network disin- 
tegrates in a second order transition. Moreover, opposed to networks containing only connectivity 
links where a broader degree distribution results in a more robust network, when both types of links 
are present a broad degree distribution leads to higher vulnerability. 



Many friendships between individuals in a social net- 
work, numerous business connections in a financial net- 
work or multiple cables between Internet routers, are all 
examples of networks with a high density of connectivity 
links [lMlOj. Such networks are regarded as very stable 
to attacks since even after a failure of many nodes the 
network still remains connected. In contrast, dependen- 
cies between the network nodes endanger the network 
stability since the failure of several nodes may lead to 
the immediate failure of many others. As an example 
consider a financial network: Each company has trading 
and sales connections with other companies (connectivity 
links). These connections enable the companies to inter- 
act with each other and function together as a global fi- 
nancial market. But there are also dependencies relations 
between companies, several companies that belong to the 
same owner depend on one another. If one company fails 
the owner might not be able to finance the other compa- 
nies that will fail too. Such dependencies jeopardize the 
network stability and are the possible cause of many ma- 
jor financial crises. Another example is an online social 
network (Facebook or Twitter): Each individual commu- 
nicates with his friends (connectivity links), thus forming 
a social network through which information and rumors 
can spread. However, many individuals will only partici- 
pate in a social network if other individuals with common 
interests also participate (dependency links) in that so- 
cial network. 

The effect of failing nodes on the network stability has 
been studied se par ately for networks containing only con- 
nectivity links 111 Hi and for networks containing only 
dependency links [17H2l| . The fundamental difference 
between connectivity and dependency links is that for 
dependency links the failure of a direct neighbor of a 
node leads to the direct failure (with some probability) 



of that node, but for connectivity links a node fails only 
when it (or the cluster it is in) becomes completely dis- 
connected from the network. Percolation theory is a ma- 
jor tool for studying network stability when the network 
is connected only with connectivity links. In a percola- 
tion process on a network of size N , a fraction 1 — p of 
the network nodes are removed. If the remaining frac- 
tion of nodes, p, is larger then a critical value (p > p c ), 
a spanning cluster connecting order N nodes exists, if 
however, p < p c , the network collapses into small clus- 
ters. At p = p c the network undergoes a second order 
phase transition Ill4l6j. 

Previous studies of networks containing dependencies 
can be divided into two categories: (i) Overload failures 
in networks containing a flow of a physical quantity. For 
example, disturbances in power transmission systems or 
congestion inst abil i ties in transportation networks and 
Internet traffic 
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- 2(3]. These models show that when 
one node is overloaded and the traffic cannot be routed 
through it, choosing alternative paths will cause other 
nodes to also become overloaded. This process may de- 
velop into a series of cascading failures that can disable 
the entire network, (ii) Models based on local dependen- 
cies, such as decision making of interacting agents [2lj |. 
In these models the state of a node depends on the state 
of its neighbors and therefore a failing node will cause 
it's neighbors to also fail and so on. 



RESULTS 

Here we present an analytical framework for studying 
the robustness of networks that include both connectiv- 
ity and dependency links. When nodes fail in a network 
containing both types of links, two different processes 
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dependency failure 



FIG. 1: Demonstration of the synergy between the percola- 
tion process and the failures caused by dependency links (de- 
pendency process) that lead to an iterative process of cascad- 
ing failures. The network contains two types of links: connec- 
tivity links (solid lines) and dependency links (dashed lines), 
(a) The process starts with the initial failure of two nodes 
(marked in red). The connectivity links connected to them 
also fail (marked in red), (b) Percolation process - in this 
stage all the nodes and the connectivity links that are con- 
nected to them, that are not connected to the giant cluster 
(largest cluster) by connectivity links also fail (marked in red) . 
(c) Dependency process - the nodes that depend (connected 
by dependency links) on the failing nodes also fail (marked in 
red), (d) The next step of connectivity failure in which two 
more nodes fail because they are not connected to the largest 
cluster (currently containing only two nodes). 



occur, (i) Connectivity links are disconnected, causing 
other nodes to disconnect from the network (percolation 
process), (ii) Failing nodes cause other nodes that de- 
pend on them to also fail even though they are still con- 
nected via connectivity links (dependency process). We 
show that the synergy between the percolation process 
and the dependency process leads to a cascade of failures 
that can fragment the entire network (FigO}. We find 
that the density of dependency links, q, plays a key role 
in determining the robustness of such networks. For net- 
works containing connectivity links and a high density of 
dependency links, an initial failure of even a small frac- 
tion of the network nodes disintegrates the network in a 
form of a first order phase transition. 

If however, the fraction of dependency links is reduced 
below a certain threshold, q Cl the network disintegrates 
in a form of a second order phase transition. The cas- 
cading process leading to a first order transition exists 
for a wide range of topologies including lattices, ER and 
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FIG. 2: (a) Simulation results showing the first and second 
order phase transitions in lattice, ER and SF networks. The 
fraction of nodes in the giant component at the end the cas- 
cade process, , is shown as a function of p for q = 1 (filled 
symbols) and for q — (open symbols), where q is the fraction 
of dependent nodes. For q = 1, Qoo abruptly drops to zero at 
the transition point characterizing a first order transition. For 
9 = 0, Qfoo gradually approaches zero as expected in a second 
order transition. The SF (circle) and ER (square) networks 
presented both have the same average degree of (k) = 3.5. 
Thus, SF networks that are most robust when only connec- 
tivity links exist (very low transition point for q — 0) become 
most vulnerable when dependency links are added (very high 
transition point for q = 1). (b) The transition points, p 1 for 
the first order region (solid line) and p 11 for the second or- 
der region (dashed line) are plotted as a function of q (the 
fraction of dependent nodes) for ER (squares) and SF (cir- 
cles) networks with the same average degree (k) — 4. For ER 
networks theoretical results (confirmed by simulation results) 
are obtained according to Eq.© and Eq.Q presented in the 
paper. For SF networks simulation results of a network with 
A = 2.9 are presented since exact theoretical results are not 
available. 



SF networks, indicating it is a general property of many 
networks (FigJ^a)). Comparing networks with both con- 
nectivity and dependency links but with different topolo- 
gies, reveals a new relation between topology and the 
robustness to random failure: Networks with a broader 
degree distribution of connectivity links are more vul- 
nerable to random failure in the presence of dependency 
links. This is opposed to the known result for networks 
containing only connectivity links, where networks with a 
broader degree distribution are significantly more robust 
to random failures. Figl^a) and Fig|3IR) show that when 
comparing ER and SF networks with the same average 
degree, SF networks with a high density of dependent 
nodes are more vulnerable to random failures then ER 
networks. 

FORMALISM 

Next we present an analytical approach for studying 
the robustness to random failure of networks containing 
the two types of links. Without loss of generality we 
define a model in which only pairs of nodes depend on 
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one another, forming dependency groups of size 2. When 
the dependency group contain more then two nodes the 
cascade effect is even more extreme and the transition 
from the regular second order percolation transition to a 
first order transition occurs even for more stable networks 
(see SI). Therefore, the new properties we present for the 
case of dependency groups of size 2 are also valid in the 
general case of larger dependency groups (see Fig.l in SI). 
The model is defined as follows: A network containing N 
nodes is randomly connected by connectivity links with 
a degree distribution P(k) and an average degree (k). 
In addition, pairs of nodes are connected by dependency 
links as follows: a) A node can only have one dependency 
link, b) If node i depends on node j then node j depends 
on node i. For this model we denote by q the fraction of 
nodes that have dependencies. 

We start by presenting the formalism describing the 
iterative process of cascading failures for the simple case 
of q = 1 (see Fig.l in SI). Each iteration (step) includes 
failures that are the result of the percolation process and 
failures that are the result of the dependency process. 
The goal of the formalism is to describe the accumulated 
process up to step n as an equivalent single random 
removal, r n , from the original network. The remaining 
fraction of nodes after such a removal is j3 n = 1 — r n . 
The new network after the removal of a fraction r„ 
of the nodes, has a giant component consisting of a 
fraction g{P n ) of the remaining nodes which is a fraction 
c^n+i = Png(Pn) from the original network. 
The iterative process is initiated by the removal of a 
fraction tq = 1 — p of the network nodes. The remaining 
part of the network is Po = P- This initial removal will 
cause additional nodes to disconnect from the giant 
cluster due to the percolation process. The fraction 
of nodes that remain functional after the percolation 
process is a\ — Pog{Po)- Each node from the non 
functional part (1 — ai) will cause the node that 
depends on it to also fail (dependency process). The 
probability that a node depending on a non functional 
nodes has survived until now is a\. Therefore the 
fraction of new nodes that will fail due to dependencies 
is <5i = (1 — oti)ai. The accumulated failure includ- 
ing the initial failure of 1 — po and 5± is equivalent 
to a random removal of r± = (1 — flo) + (1 — Qu)A) 
from the original network (see SI). The remaining 
fraction of nodes after the new removal is therefore 
Pi = 1 — ri = PqUi — Pog(Po)- The remaining functional 
part of the giant component is now ot2 = Pig(Pi)- To 
calculate the fraction 82 of nodes that are disconnected 
due to dependencies at the second stage, recall that at 
the previous stage a fraction failed from a\. The 
remaining part of ot\ was therefore ol\ — 81 = af . Thus 
62 = (a^/cq )(af — a%) = [1 — {0*2/ & 2 )]ot2- This is equiva- 
lent to a random removal of r 2 = (1 — Pi) + [1 — (a^/cnf)]^! 
from the original network. The remaining fraction of 
nodes is /?2 = 1 — T2 = ot2<^\/ Pi — P^giPi)- Following 



this approach we can construct the sequence, P n , of the 
remaining fraction of nodes in the network after each 
iteration. 

Po=P- 

pi=p 2 g(p ). 

p 2 = P 2 g(Pi)... 

Pn =p 2 g(Pn-l). 

Following a similar approach for the general case of < 
q < 1 (see SI) yields the sequence /?„ = qp 2 g(p n -i) + 
p(l — q). Given, /3„, the fraction of nodes in the giant 
cluster is a n+ i = P n g(P„) = p(l - q{l - pg{P n )))g(p n ). 
Fig.EJa) compares theory and simulations of q„, for the 
case of an ER network. 




FIG. 3: (a) Comparison between simulations and theoretical 
results for the fraction of nodes in the giant cluster on every 
step n of the iterative process of failing nodes. The results 
are shown for an ER network with q = 0.8 and p = 0.84 
(p ~ p 1 ). The theoretical results (line) are calculated accord- 
ing to Eq.JTJ (the explicit form of g(x) is presented in the 
text) and are compared to several realizations of computer 
simulations on networks of size N = 200K. (b) The number 
of iterative failures (NOI) are shown for a scale free network 
with A = 2.7 and 3=1. At the first order transition point, 
the number of iterative failures that the network undergoes 
before disintegrating scales as N 1 ^ 4 (see SI). This number 
sharply drops as the distance from the transition is increased. 
Thus, plotting the number of iterations as a function of p pro- 
vides a useful method for identifying the transition point, p 1 , 
at the first order region. The inset shows that the size of the 
second largest cluster reaches its maximum value at the sec- 
ond order transition point, p 11 , therefore providing a useful 
method for identifying p 11 at the second order region. 

To determine the state of the system at the end of the 
cascade process we analyze p n at the limit of n — > 00. 
This limit must satisfy the equation P n = Pn+i since at the 
end of the process the cluster is not further fragmented. 
Denoting p n = P n —i — x we arrive to the equation: 

X =p 2 qg(x) +p(l - q); (1) 

This equation can be solved graphically as the intersec- 
tion of a straight line y = x and a curve y — p 2 qg(x) + 
p(l ~ q). When p is small enough the curve increases 
very slowly and does not intersect with the straight line 
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(except at the origin which corresponds to the trivial so- 
lution) . The critical case for which the nontrivial solution 
emerges, corresponds to the case when the line touches 
the curve at a single point x and in this point we have 
the condition 1 — p 2 q^(x), which together with Eq.([TJ 
gives the solution for the critical fraction of failing nodes 
that will fragment the network and the critical size of the 
giant component. 



ANALYTICAL SOLUTION 



An exact analytical solution can be obtained using the 



transition (p — p 11 ). Solving system ([2]) for / — > 1 yields: 



apparatus of generating functions. As in Refs. 22-24 
we introduce the generating function of the degree distri- 
bution Go(£) = Ylk P(k)t; k ■ Analogously, we also intro- 
duce the generating function of the underlying branching 
process, Gi(£) = G' (£)/G' (l). A random removal of a 
fraction 1 — p of nodes will change the degree distribu- 
tion of the remaining nodes, so the generating function of 
the new distribution is equal to the generating function 
of the original distribution with the argument £ replaced 
by 1 — p(l — £) (22| . The fraction of nodes that belong 
to the giant component after the removal of 1 — p nodes 
is g(p) = 1 — Gq[1 — p(l — /)], where / = f(p) satisfies a 
transcendental equation / = Gi[l — p(l — /)] [24$ . 

In the case of an ER network with a Poisson degree 
distribution (IlT[l3| , the problem can be solved explicitly 
since Gi(£) = G (£) = exp((fc) (£ - 1)). Accordingly, 
g(x) = 1 — / and / — exp[(fc) x(f — 1)] where x is denned 
in Eq.([T]). The fraction of nodes in the giant component 
at the end of the cascade process is then given by = 
/W/3oo) = p(l - 9(1 -p(l -/)))(!-/)• The equation 
/ = f(q,p,k) has a trivial solution at / = 1. The non- 
trivial solutions of / can be presented by the crossing 
points of the two curves in a system of equations that 
are given with respect to x and /: 



x=p 2 q(l- f)+p(l-q) 
X " (*)(/-!)• 



,0</<l (2) 



For the trivial solution at / = 1 the size of the giant 
component is zero (a^ — 0). For the solutions that 
are the crossing points of the two curves, / < 1, i.e., 
Ooo > 0. Thus, the case where the curves tangentially in- 
tersect corresponds to a first order phase transition point 
(p = p 1 ) where abruptly jumps from a finite size 



p n (k)(l-q) = l 



(4) 



The analysis of Eq.(|3]) and Eq.(|4]) shows that the first or- 
der transition at p = p 1 occurs for networks with a high 
density of dependency links (q > q c ), while the second or- 
der transition at p = p 11 , occurs for networks with a low 
density of dependency links (q < q c ). This is confirmed 
by Fig. 4(a) that compares theory and simulations for 
p II {q) and p 7 (q). The critical value of q c (and p c ) for 
which the phase transition changes from first order to a 
second order is obtained when the conditions for both 
the first and second order transitions are satisfied simul- 
taneously. Applying both conditions we obtain 



q c = «fc> + 1- ^2 (fc) + 
p c = l/(V2<fc) + l-l). 



SIMULATIONS 



(5) 



Next, we support our analytical results by simulations. 
Finding the transition point via simulations is always a 
difficult task that requires high precision. In the case 
of the first order transition we are able to calculate the 
transition point with good precision by identifying the 
special behavior characterizing the number of iterations 
(NOI) in the cascading process. At the first order tran- 
sition point, the NOI scales as A 1 / 4 (see SI) which is 
also demonstrated by the long plateau in Fig.[3][a). This 
number sharply drops as the distance from the transition 
point is increased, since away from the transition point, 
p 1 , the NOI scales as log N/ip-p 1 ) (see SI). Thus, plot- 
ting the NOI as a function of p, provides a useful and pre- 
cise method for identifying the transition point p 1 at the 
first order region. For the second order region a similar 
behavior exists for the size of the second largest cluster 
which also reaches its maximum at the transition point 
[lfj. Fig. ^b) presents simulation results of the NOI. 
The transition point, p 1 , can easily be identified by the 
sharp peek characterizing the transition point. The inset 
of Fig.[3]Jb) presents a similar behavior for the size of the 
second largest cluster near the second order transition 



above p 1 to zero below p 1 ||. The condition for the P oint > P - ^ 4 (^) compares simulation results and 
first order transition is that the derivatives of the equa- 
tions of system ([2]) with respect to / are equal. Together 
with system @ this yields: 

(P 1 ? (k) q = -l/[(/ - 1)/] + In //(/ - I) 2 (3) 

However, for a solution of system ([2]) where / — > 1 
(ctoo — 0) there is no jump in the size of the gi- 
ant cluster and thus the transition is a second order 



theory for the transition points p I (q) at the first order 
region (solid line) and p II (q) at the second order region 
(dashed line) . The transition points were obtained using 
the NOI and the second cluster size techniques respec- 
tively. The theoretical results for different values of q 
and (k) were calculated by solving system (f2|) together 
with Eq.([3l) or Eq.flJ) respectively. Fig. EJb) compares 
the values of the transition points p 1 (q) and p II (q) re- 
spectively between SF and ER networks with the same 
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FIG. 4: (a) Theory (lines) and simulations (symbols) are 
compared for the values of p I {q) and p II {q) for ER networks 
with different values of (k). For q > q c the network under- 
goes a first order transition, therefore the theoretical values 
of the transition point, p 1 (q), that are calculated according 
to Eq.(3) are compared with simulations performed using the 
NOI method (explained in text). For q < q c the network 
undergoes a second order transition, therefore the theoretical 
values of the transition point, p 11 (q), that are calculated ac- 
cording to Eq.(4) are compared with simulations performed 
using the second largest cluster method (explained in text). 
The line separating between the first and second order is ob- 
tained according to Eq.(5). (b) Comparison between simu- 
lation (symbols) and theory (lines) for as a function q 
for different values of (k). Qoo, at the phase transition point 
is finite for a first order transition and a zero fraction for a 
second order transition. 

average degree. For networks with a small fraction of de- 
pendencies (second order transition region) SF networks 
are more robust to random failure (lower p H ). For net- 
works with a high fraction of dependencies (first order 
transition region) SF networks become more vulnerable 
(higher p 1 ). Fig. 4(b) compares simulation and theory 
for aoo, the fraction of nodes in the giant cluster at the 
transition point. Above q c , is finite characterizing 
a first order transition, while below q c , ««, is zero as 
expected for a second order transition. 

DISCUSSION 

Here we show that in order to properly model real net- 
works two different type of links are needed: connectivity 
links and dependency links. We present an analytical for- 
malism for a general network model including both con- 
nectivity and dependency links. According to our model, 
networks with high density of dependency links are ex- 
tremely vulnerable to random failure and when a critical 
fraction of nodes fail the network disintegrates in a form 
of a first order phase transition. Networks with a low 
density of dependency links are significantly more robust 
and disintegrate in a form of a second order phase tran- 
sition. In the limit of zero fraction of dependency links 
our general solution yields the known results for networks 
with only one type of links. Our framework also provides 



an analytical solution for the critical density of depen- 
dency links for which the phase transition changes from 
a first order to a second order percolation transition. We 
develop a powerful simulation method for accurately es- 
timate the transition point, based on the unique behavior 
of the NOI (number of iterations in the iterative process 
of cascading failures) that diverges at the first order tran- 
sition point. Using this method we are able to provide 
very accurate simulation results supporting our analyti- 
cal results. 
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